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Abstract 

A fading broadcast channel is considered where the transmitter employs two 
antennas and each of the two receivers employs a single receive antenna. It is 
demonstrated that even if the realization of the fading is precisely known to the re- 
ceivers, the high signal-to-noise (SNR) throughput is greatly reduced if, rather than 
knowing the fading realization precisely, the trasmitter only knows the fading real- 
ization approximately. The results are general and are not limited to memoryless 
Gaussian fading. 

1 Introduction 

It is quite remarkable that if the transmitter and the receivers in a multi-antenna Gaussian 
fading broadcast channel are cognizant of the precise realization of the fading process, 
then the limiting ratio, as the signal-to-noise ratio (SNR) tends to infinity, of the sum- 
rate capacity of the broadcast channel to the capacity of the single-user channel that 
results when the receivers are allowed to cooperate is one. Thus, if the fading realization 
is perfectly known at all terminals, then no asymptotic loss is incurred due to the lack 
of receiver cooperation. Here we investigate this limiting ratio in the more realistic case 
where the transmitter has only an approximate estimate of the fading realization. We 
show that the broadcast channel's sum-rate capacity suffers from this inaccuracy far 
more than the single-user's channel capacity. Indeed, in a broadcast channel where the 
transmitter employs two transmit antennas and each of the two receivers employs a single 
antenna, if the transmitter only has an approximate estimate of the fading realization, 
then this limiting ratio is upper bounded by 2/3; see Theorem ^ Thus the price of using 
a broadcast channel where the receivers cannot cooperate is in the lack of robustness 
with respect to the precision with which the transmitter knows the fading realizations. 

It is interesting to note that in the single-user channel the asymptotic capacity is not 
sensitive to this precision at all. Indeed, the high SNR asymptotic ratio of the single- 
user channel capacity in the absence of any transmitter information about the fading 
realization to the capacity when the fading realization is known precisely is typically one. 
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It should be emphasized that throughout this paper we assume that the receivers have 
precise knowledge of the fading realizations. Otherwise, both the sum-rate capacity of 
the broadcast channel and the capacity of the single-user channel typically grow double- 
logarithmically [5], 0, PHI in the SNR. Also, we assume that the precision of the 
transmitter's estimate of the fading realization is fixed and does not improve with the 
SNR (which could in some scenario be motivated when only a fixed rate feedback link is 
available). The fading is not assumed to be Gaussian and we allow also for memory. 

2 Previous Results 

The Multi-Input-Multi-Output Gaussian Broadcast Channel (MIMO-GBC) has been 
studied extensively in recent years. First the optimal sum-rate has been characterized, 
[T], [T§|, [IE], |22] and that has been followed by the full determination of the capacity 
region [201, which is one of the few examples of a non-degraded broadcast channel for 
which the capacity region has been determined completely. For a short review on this 
subject, see [2]. 

Most of the literature focuses on symmetric Gaussian fading, where the transmitter is 
equipped with M antennas and there are K single-antenna users which are statistically 
equivalent, that is: 

Y = HX + N, (1) 

where Y is a K component vector composed of the received signal at the different K 
users, and H is a (K, M) matrix with IID Gaussian components, X designates the M 
component transmit vector, and N the K components IID additive Gaussian noise vector. 

One of the great attraction of these capacity results is the multiplexing gain, that is 
min(M, K) which remains the pre-log factor of the sumrate, though no cooperation of the 
receivers is allowed in the broadcast setting. This multiplexing gain is the central factor 
impacting the rate region in the high SNR regime. Though unable to provide optimal 
performance, numerous simple precoding techniques do preserve the multiplexing gain, 
and in particular techniques such as vector-perturbation precoding, as well as standard 
channel equalization (zero-forcing or MMSE channel inversion) procedures [Hj, [7j, |21j . 
[2], and references therein. 

All the associated information-theoretic optimal techniques rely heavily on the precise 
availability of the channel propagation matrix, H at both transmitter and all receivers. 
In fact, the underlying element in the optimal information theoretic approach to the 
MIMO-GBC, that is the 'Dirty-Paper' coding [I] renders the whole scheme rather sensi- 
tive to the accuracy of the CSI available at the transmitter. Also suboptimal techniques, 
such as the classical zero-forcing channel-equalizers assume the full availability of CSI 
at the transmitter [7j, j2I]. Already in P, this point has been emphasized, and it has 
been demonstrated that for a symmetric case where all users enjoy statistically equivalent 
channels (as the case here), the rate region of the MIMO-GBC with no CSI available at 
the transmitter corresponds to that of K = 1. The total lack of CSI realization knowl- 
edge, causes the absolute collapse of the multiplexing gain to unity. This observation in 
PP has been extended to address a general isotropic model in In [5] H is assumed 
to be of an isotropic structure that is: H = UG, where G is a diagonal matrix repre- 
senting different channel gains to the different users, while U is a unitary matrix. In 
this case it has been shown that the MIMO-GBC capacity region is equivalent to the 
capacity region associated with a single transmit antenna broadcast channel, reducing 



again the multiplexing gain to unity. The same conclusions remain if the G matrix (i.e. 
gain information) is available at the transmitter site, parallelling the achievable perfor- 
mance in a TDMA-like regime, where the best single user is addressed jS]. It should 
be emphasized that this is in sharp contrast to the single user M transmit K received 
antenna case (equivalent to the MIMO-GBC, where the receivers are allowed to cooper- 
ate). In the latter, the impact of the lack of CSI is well understood and is rather limited 
in the high SNR regime, where optimal singular-value decomposition and water-pouring 
provide marginal advantage in the standard model [T7j, [T2J. See jl] and [T3] for cases 
where special attention is called for. In the standard single user MIMO model the full 
multiplexing gain is typically maintained even without any transmitter CSI. 

The importance and practical implications of partial CSI knowledge, and the appar- 
ent sensitivity of the achievable rate in the MIMO-GBC to accuracy of CSI, motivated 
intensive research work which encompasses by now many dozens of recent contributions. 
Those span now different aspects, such as opportunistic approaches, linear and non-linear 
precoding in the presence of partial CSI, multiple beam-forming subjected to uncertainty 
conditions, feedback constrained signaling, Lattice reduction techniques, scheduling, an- 
tenna selection, and numerous other approaches. See |2j for a short discussion. 

In [HI] h was substantiated that the optimal scaling of the sumrate C(SNR) for 

MIMO-GBC, that is: M x log^SNR x \og{K) \ can be maintained with a rather limited 
CSI. The limited central idea is based on transmission of random orthogonal M beams 
and the CSI comprises the signal-to-interference ratio (SIR) observed at each beam. 
Transmission then is done on each beam to the user that enjoys the maximum SIR. While 

the M x log ^SNR x \og(K) j conditions of fixed M and SNR, while K increases to infinity. 

The multiplexing gain however is defined as limC(SNR)/ log(SNR), SNR — > oo, that is 
fix M and K (though can be chosen as large as wished). Under these circumstances, 
it is not too difficult to realize that the opportunistic scheme in jTH] is interference 
limited, that is: C(SNR)/ log(SNR) — ► 0. This, in principal, is also the case typifying 
many variants of MIMO-GBC with a variety of signaling/coding approached under finite 
precision CSI, that is the elements of the propagation matrix are available with a finite 
precision. This is not to say that any sort of partial CSI inflicts the destruction of 
the degrees of freedom, as is demonstrated by considering the case where perfect CSI 
is available just for judiciously selected subclass of scheduled users. Another example 
considers the availability of spatial information, say U in the isotropic model discussed 
above. It is straightforward to realize that full multiplexing gain can be maintained if 
equalization, say zero forcing, is performed with respect to U alone |21j . 

These methods however do not support finite precision CSI as is in focus here. In fact, 
finite precision could be associated with fixed finite rate feedback channels, as classical 
rate distortion theory imposes a bound on the accuracy of CSI. When the rate of the 
feedback channel may in fact be increased with the increase of SNR, evidently the higher 
the SNR, the higher the CSI precision and the qfull multiplexing gain can be preserved. 
In a recent contribution [6 , a finite-rate feedback is considered, which scales with SNR. 
Multibeam forming is assumed where each user feedbacks the vector source coding of 
its own channel, employing unitary quantization codes. Then transmission is done to a 
randomly selected set of users with different beamformers. It is demonstrated that for 
a feedback rate proportional to log(SNR) the full multiplexing gain is maintained. The 
resultant effect is due to the enhanced accuracy which scales exponentially with SNR 
(proportional to the feedback rate), hence mitigating the interference-limited regime. 



3 Channel Model and Main Result 



We consider a broadcast channel with a transmitting node X and two receiving nodes 
Y and Z. The transmitter employs two antennas so that the transmitted symbol takes 
value in R 2 . The receivers each employ a single antenna, and the received symbols thus 
take value in R. Assume that the sequences of random vectors in R 2 {A k }, {Afc}, {H fc } 
and {H^.} are generated by nature according to some given law. When at time-A; the 
symbol Xfc G R 2 is sent, the received symbol at Terminal Y is then given by 

Y k = A\* k + (2) 

where Afc models the fading and is given by 

A k = A k + A k (3) 

and where {N^} is a sequence of IID scalar zero-mean variance-a 2 real Gaussian ran- 
dom variables modeling the additive noise. (Throughout this paper all Gaussian random 
variables are assumed to be of positive variance.) The random vector A k models the 
transmitter's estimate of the time-A; fading level and Afc denotes the transmitter's esti- 
mation error. That is we assume that the transmitter has access to the sequence {Afc} 
but not to {A k }. The receivers, however, both have access to both {A k } and {A k }. 

The received signal at Terminal Z is analogously defined with similar statistical as- 
sumptions by 

z k = tr fcXfe + ivf } (4) 

and 

Hfc = H fe + Hfc (5) 

where {Hfc} is known to both the transmitter and to the two receivers, and where {Hfc} 
is only known to the two receivers. Thus, the transmitter knows {Afc} and {Hfc} whereas 
both receivers know {Afc}, {Afc}, {Hfc}, and {Hfc}. 

The additive noise sequences {N^} and {N^} are assumed to be independent, each 
being IID N(0,a 2 ). Moreover, ({N ( k z) }, {N^}) is independent of ({A k },{A k },{~k k }, 
{Hfc}) and conditional on ({Afc}, {Hfc}) the joint law of 

({Ni%{Nl%{A k },{H k }) 

is not depending on the input sequence {x fe }. 

We denote the message intended for Terminal Y by My and assume that it is uni- 
formly distributed over the set M.y = {1, . . . , |_e nRy J} where n denotes the blocklength 
and Ry denotes the transmission rate to Terminal Y in nats per channel-use. The objects 
Mz, Aiz, and Rz are analogously defined and we assume that My is independent of Mz- 

We consider an average power constraint 



1 n 

-^E[||Xfc(M y ,M z ,A 1 ,H 1 )|| 2 j <£, (6) 



n 

k 



where Xfc(my, m z , a™ , h") denotes the symbol that is transmitted at time-A; when the 
messages my, mz are to be conveyed and when the transmitter's estimate of the fading 
levels are a™ and h", respectively. (Here A™ stands for Ai, . . . , A n .) We define a rate 



pair (Ry,Rz) to be achievable as in the standard broadcast channel set-up Ch. 14] 
and denote the supremum of Ry + Rz over all achievable pairs (Ry, Rz) by C^ ot [E / 'a 2 ) . 

Since the capacity region of a broadcast channel depends only on the conditional 
marginal distributions [3J Theorem 14.61], there is no loss in generality in assuming, as 
we shall, that conditional on ({A^}, {Hfc}) the processes and {H^} are independent, 
i.e. that 

{A,}^-({A fc }, {H fc })^-{H fc } (7) 

forms a Markov chain. 

The main result of this paper is about the asymptotic behavior of CT 0t (S / a 2 ) as SNR 
tends to infinity: 

Theorem 1. In the above set up, assume that for every k the distribution of A^ is 
the same as the distribution of Ai and that the distribution of is the same as the 
distribution of Hi. Moreover, 



EtllAiH 2 ] ,E[||H X || 2 ] <oo. 



If, additionally, 



and 



AJ,HJ > -oo 



then 



lim -h[A% 

n. — >on ft 



lim ~/i(H? AJ,HJ) > -oo 

n— >oo Tl 



— C Tot (S/a 2 ) 2 

urn - — ; . ON < -. 

s->oo log(l + S/a 2 ) ~ 3 



(8) 

(9) 
(10) 

(11) 



Note that since the transmitter can always choose to transmit to only one receiver, the 
limiting ratio is typically at least 1/2. We conjecture that the limiting ratio is indeed 1/2, 
leading to a "complete collapse of degrees of freedome" [TT] . Note that in the coherent 
case where the transmitter knows the fading precisely this limit is typically 1. It is also 
typically 1, even in the absence of any transmitter side- information, if the receivers can 
cooperate. 



4 Lemmas 

Lemma 1. For every T > let h max (T) denote the supremum of the differential entropies 
of all random variables G taking value in the interval [— tt, tt) and satisfying 



log 4 



101 



(12) 



where log + (£) = max{0,log£} for all £ > and log + (0) is defined to be zero. Then 



-/vax(r) = r - io g r - iog(2e) + 0(1; 



(13) 



where the o(l) term tends to zero as F — > oo. 



Proof. The proof is based on Thm. 11.1.1] according to which the density f*(9) over 
[— 7r,7r) that achieves h mayL (T) has the form 



C for 1 < 101 < 7T 



(14) 

integrates 
□ 



where the constants c > and < a < 1 are chosen to guarantee that /* 
to one over [— n, tt) and that the constraint is satisfied. 

Corollary 1. For every 5 > there exists some M(5) > such that for any random 
variable taking value in an interval of length 2n and having differential entropy h(Q) 



hg+ W\ 



< max{M(S),-{l + S)h{e)}. 



(15) 



Proof. We first note that it suffices to prove the bound for the case where G takes value in 
the symmetric interval [— tt, tt). Indeed, if takes value in any (not necessarily symmetric 
about the origin) length-27r interval then 



"l + 1 " 










< E 


log~ 




g W\. 






|0|_ 



where 

= mod [— 7T, 7r). 

But h(Q) = h(Q) so that the result will follow from the result for 0, which takes value 
in [—7i, 7r). 

We now proceed to prove the result for the case where takes value in [— tv, tx). By 
Lemma ^ there exists some M(S) such that 



-^max(r) > 



1 + 5 



r, r > m{8) 



(16) 



and hence, 

T < max{M(5), -(1 + 5)/i max (r)}. (17) 

The corollary now follows from this inequality by noting that, by the definition of h m3iX 
in the lemma, if G [— 7r,7r) is arbitrary (i.e., not necessarily having the max-entropy 
distribution) then 

1 



h(Q) < K 



log 



101 



□ 



Lemma 2. Let the random variable X be of finite second moment and independent of the 
Gaussian random variable of positive variance U. Let F$(-) and Ft(-) be two distribution 
functions on the real line with finite variances. Then for any distribution function F(-, ■) 



on 



of marginals F(-, +oo) = Fs(-) and _F(+oo, 



we have 



/oo roc /"OO /»oo I 

h(sX + U)F s (s) > h(tX + U)F T (t) - / log + Mf(s, t). 

-oo J —oo J —oo J —oo \S\ 

Consequently, using log + (a/6) < log + (a) +log + (l/6) ; we have 

/oo roc poo poo i 

h(sX + U)F s (s) > h(tX + U)F T (t)- log + \t\F T - log+ — F s 
-oo J —oo J —oo J —oo \"\ 



(19) 



(20) 



Proof. The proof is based on two inequalities: 



h(sX + U)>h(tX + U)-1ogj-\, \t\>\s\ (21) 

\s\ 

and 



h(sX + U) > h(tX + U), \t\<\s\ (22) 
which combine to prove that 



h(sX + U)> h{tX + U)- log + f-^ (23) 



from which the lemma follows by integration. 

Both inequalities follow by noting that a zero- mean variance- (a 2 + b 2 ) random vari- 
able can be expressed as the sum of two independent zero-mean Gaussians of respective 
variances a 2 and b 2 . Indeed, (|21|) follows from: 

h(sX + U) = h fasX + UU - log j| 

= h (tX + ^UJ - log |4 



>h(tX + U)-\og\-j, |4 ^ 1 



1*1 


\t\ 


\s\ 


\s\ 



where the first equality follows from the behavior of differential entropy under scaling 
and the last inequality follows by writing (t/s) ■ U as a sum of two independent Gaussians 
of variances a 2 and (t 2 /s 2 — l)a 2 and by conditioning on the latter. 

Similarly, (}2*2*|) follows for \t\ < \s\ using the Data Processing Inequality: 

/(X; sX + U)> I(X; UsX + U)) 

s ' 

= l(X;tX + -U) 

v s 

>l(X;tX + U), ]4<1 

from which ()22|) follows by expanding mutual information in terms of differential en- 
tropies. Here the first inequality follows because scaling does not change mutual infor- 
mation unless the scaling is by zero, and the last inequality follows by noting that U can 
be written as a sum of independent Gaussians of variances (t 2 /s 2 )a 2 and (1 — t 2 /s 2 )a 2 
and by invoking the Data Processing Inequality. □ 

Lemma 3. Let the random vector W take value in IR 2 and let R and G denote its mag- 
nitude and phase in the sense that R = || W|| ; e [— 7r, 7r) and W T = (Rcos 0, i?sin 0). 
Assume that W is of finite second moment and finite differential entropy. Then 

h{Q) = h{W)-h{R\Q)-E[\ogR] (24) 
> h(W) - h(R) - E[\ogR] (25) 



with equality if, and only if, R and are independent. 



Proof. The equality follows directely from the behavior of differential entropy under 
change of coordinates; see, for example, fUJ Lemma 6.16]. The inequality follows be- 
cause conditioning can only decrease differential entropy unless the random variables are 
independent. □ 

Lemma 4. Let the pair (X,Y) of finite-variance random variables be independent of the 
pair (U, V) where U and V are IID J\f(0, a 2 ) for some a 2 > 0. Let 

H{9) = h((X + U) cos 9 + (Y + V) sinfl), -tt < 9 < tt, 
Ti snp = sup H{0). 

-1T<6<1T 

Then for any distribution function Fq(-) on [—71,11), 

^ H(9)F e (9) > ~W sup + ~ log(27rea 2 ) + inf f log | sm{9 - <f>)\F e (9) (26) 

> \n sup + l - logged 2 ) - log I - 3 max |m(1/2), ~K&)\ (27) 

where M(l/2) is a universal constant that is defined in Corollary^ Using h(Q) < log(27r) 
we have 

max{M(l/2),-^(6) < M(l/2) - h(Q) + ^log27r (28) 

and hence 

J 71 H(9)F e (9)>±H snp + ilog o- 
where 7 is some universal constant. 

Proof. Define TCi n { = in-U<=i-w,n) T~C(Q) an d note that TCi n { > 1/2 log(27recr 2 ). Also, because 
X and Y have finite second moments, H sup < 00. For some fixed 5 > let 6* max and 9 m { n 
be such that 

7^(#min) < Ti-xai + 5, 7~C{9 max ) > H sup — 5. (30) 
For any 9\, 9 2 G [—tt, tt) denote by J{9\, 9 2 ) the joint differential entropy 

J(9 h 9 2 ) = h(cos9 1 (X + U) + sm0 1 (Y + V)) , cos 9 2 (X + U) + sin 9 2 (Y + V)). (31) 

Note that 

J(9i,9 2 )<H(9 1 )+H(9 2 ) (32) 

and that because 

'cos 9 1 (X + U) + sin 9 X (Y + V)\ _ /cos 9 X sin 6»A fX + II s 
cos 9 2 {X + U) + sin 9 2 (Y + V) J ~ \cos 9 2 sin 9 2 J \ Y + V / 

it follows that 

J{9 U 9 2 ) = J(0, tt/2) + log | sin(# 2 - X )|. (33) 
We now have for any 9 G [—tt, tt) 

2H(9)>H(9)+H(9 min )-5 

> J(9,9 min )-5 

= J(0, tt/2) + log | sin(0 - 9^)1 - 6 

= J(9 max , 9 max + tt/2) + log | sin(6» - 9 min ) \ - 5 

> H(9 max ) + - log(2vre(T 2 ) + log | sm(9 - 9 min )\ - 5 

> n sup + - log(2vr e( T 2 ) + log | sin(0 - 9 min )\ - 25 



2| ^(©)-7, (29) 



from which (|26|) follows by letting 8 tend to zero and integrating over 9. Here the first 
inequality follows from the definition of H(9 min ); the subsequent inequality from (J52J; the 
two subsequent equalities from (jHHJ); the subsequent inequality by the chain rule and since 
conditioning reduces entropy and then using that U and V are independent Gaussians; 
and the final inequality by the definition of 9 mSuX . 
To prove (|2*7jl we use the inequalities 



sin£| >mm{ -|£|, -|£ - tt|, -|£ + tt| 

71 71 



71 



-7T < £ < 71 



(34) 



and 



log min{a, b, c} > log a + log b + log c, a, b, c > 

where log - (a) = — log + (l/a) is given by minjloga, 0} for a > and as — oo for a = to 
obtain 



71 



— log | sin(6> — 0) | < log — + log 



+ log H 



7T 



log H 



0- 



7T 



From here ()27|) can be derived from (|2*Bj) as follows: 



inf 

-7r<</><7T /_, 



log|sin(0-0)|F (0) 



> 



sup 

-7T<<7!><7r 



sup 

— 7T<0<7T J— 7T 

7T 



log|sin(0-0)|F (0) 
1 



> log 3 sup E 

2 qsk 



log - + lo, 



log" 1 



10- 



+ log 4 



+ 7T 



+ log H 



7T 



F W 
(35) 



from which (}2Tj) follows using Corollary ^ because for any a 6 K the random variable 
© — a takes value in an interval of length 2tt and has differential entropy h(Q). □ 

Lemma 5. Let the random vectors A and H each take value in M 2 independently of the 
zero-mean variance-a 2 (a 2 > 0) random variable U. Let ||A|| > and B^ G [— 7r,7r) 
denote the magnitude and angle of A in the sense that A T = (||A|| cosB^, ||A|| sinB^)- 
Let X G IR 2 be independent of (A, H, U) and assume that E[|| A|| 2 ], E[||H|| 2 ] and E[||X|| 2 ] 
are all finite. Then, 



h(A T X + U\A) > -h(H T X + U\H) 



; E[log + ||H||] -E 



log" 1 



+ ilo g( x 2 -7 + ^(B A ). 
Moreover, if additionally h(A) > — oo then 

h{A T X + U\A) > hi{B7X + U\H) - ^E[log + ||H||] - 
+\loga 2 - 1+ 9 -(h(A)-h(\\A\\)- 



\A\ 



log 



A 

E [log || A|| 2 ] 



(36) 



(37) 



where 7 is a universal constant. 



Proof. We first use polar coordinates to write 
/i(A T X + £/|A) 4 J /i(a T X + [/)F A (a) 



1 1 ^(r(cos^[l]+sin^[2]) + C/)F ||A|||0A (r|e)F 0A (0). 



Conditional on Ba = # we now apply Lemma EJ with the substitution of cos#X[l] + 
sin6X[2] for X; with the substitution of ||A|| (of law i*]|A|| |e j4 =e) f° r an d with the 
substition of 1 for T to obtain 



h(A J X + U\A) 

> J | h (cos [1] + sin [2] + U) - j log + i F| [A 

h(cos0X[l] + sin ex [2] + tf)F 0A (0) - 



|e A = 



Fe A W 



log 



A 



(3? 



We similarly express /i(H T X + C/|H) using polar coordinates and conditional on 0# = 
9 we apply Lemma El with the substitution of cos$X[l] + sin6X[2] for X; with the 
substitution of 1 for S; and with the substitution of ||H|| (of law F\\n\\ \e H =e) for T to 
obtain 

h(cos9X[l] + sin#X[2] + U) > h(H T X + U\H) - E [log + ||H||] . (39) 

Inequality (j3T?j) now follows from (}3*H|) and by applying ()29|) of Lemma 0] and by noting 
that 

sup h (cos OX [1] + sin 9X [2] + U) > [ h (cos 9X [1] + sin 9X [2] + U) F @H (9) 

\0\<ir J 

and finally applying (|3*9*j). Inequality (J37j) follows from (jHSj) using Lemma El □ 

Corollary 2. Lei X, A and H ta&e value in M 2 u>itn X-^— S^— A and X^— S— o— H 

forming Markov chains and U being jV(0, <t 2 ) distributed and independent o/(X, 5, A, H). 
// E[||X|| 2 ] , E[|| A|| 2 ] , E[||H|| 2 ] < oo and /i(A|S) > -oo, then 

1 



/i(A T X + U\A, S) > i/i(H T X + Z7|H, 5) - - E [log + ||H 



log 



-7 + + \{h{A\S) - fc(||A|| 1 5) - ~E[log ||A|| 2 ]) 



(40) 



>^(H T X + f/|H,5)-^E[log + ||H 



log 4 



1 



IAI 



-7 



+\ log a 2 + ? (/i(A|5) - i log(27reE [|| A|| 2 ] ) - ~ log E [|| A|| 2 ] ) (41) 
where 7 is, as m tae lemma, a universal constant. 

Proof. Inequality (J4*U|) follows directly from the lemma by conditioning on S. Inequality 
(141)1 follows from (|4T)j) by upper bounding ft.(||A|| \S) with h(\\ A||) and by upper bounding 
the latter by the differential entropy of a Gaussian of equal second moment. □ 

Lemma 6. Let X and S be possibly dependent random variables with X satisfying 
E[X 2 ] < 8 and S having a finite second moment. Then 

2 

where A is upper bounded by log(e)/e. 



log(^re(,S 2 E[X 2 | S]+a 2 )j\ < - log(27re(E [S 2 ] £ + a 2 )) + A (42) 



Proof. This is a fairly standard result in "optimal power allocation" [3J Sec. 10.4 & 10.5]. 
The maximum of the left hand side (LHS) of (J42|) over all choices for E[X 2 | S] integrating 
to £ has been extensively studied in the literature. Schein & Trott jTH] and Zamir & Erez 
23 have, for example, shown that the choice E[X 2 | S] = £ can be off from the optimum 
by at most log(e)/e from which the firm bound follows using Jensen's inequality. 

□ 

5 Proof of the Theorem 

Assume a sequence of encoding and decoding functions for the setting described in Sec- 
tion |3] of rates Ry and Rz such that the average probabilities of errors as defined in the 
standard broadcast set-up tend to 0. Given such a block-length n encoder define Y™ and 
Z x to be the n- length vectors of the outputs Yi, . . . , Y n and Z±, . . . , Z n , respectively. Sim- 
ilarly define A™, H", A™ and H™ to be the n-length sequences of fading coefficients. Using 
Fano's Inequality [3] and assuming that Terminal Z knows the message My intended for 
Terminal Y, one obtains an upper bound on the rates 

R Y < -I(M Y ; *7|AJ, A", H") + e(n), 
n 

R z < -I(M Z ; Z?\M Y , H?, A?, H?) + e(n), (43) 



where the term e(n) tends to as n tends to infinity. 



The rate Ry can be further upper bounded using the following lower bound 
/i(Y?|M y ,A?,A?,H?) 

n 

= {h(Y£, Z? +1 \M Y , A\, H? +1 , A™, H") - h{Yt\ Z?\M Y , A*" 1 , H™, A™, H")| 
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+/i(Zr|My,H?,A?,HJ) 

^i+i, M Y , A\, H" +1 , A", H") 
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+ - ( h{Ai + A \y;-\ z? +1 , m y , a\-\ H? +1 , A?, H? 



_ 2 log ( 2717 E 



-2 bgE 



n 1 1 

> ^{--^|zr +1 ,M y ,Hr, a?,h?) - -i 



log+HHi + Hi 



8=1 
1 



log" 1 



9 



4 

-- log E 

2 5 



+ bg a 2 - 7 + - /i(A| A\-\ A", H?) - - log 2vreE 



Aj + Aj 



+ / i (^|My,H 1 ,A 1 ,H 1 ) 



9n 



|Aj + Ajl 



9 
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nE 



log^ 



|Ai + Ad 



9n 



IokE 



|Ai + Ai| 



(44) 
H 



(45) 



I A, +A,; 



(46) 



(47) 



where 7 is a universal constant. Here Equation (}44j) is obtained by subtracting the term 
EILi Mn^f+il M Y , A\, H? +1 , A?, H") and then adding £^1 ^^My, Af 1 , H-\ 
A", H") where both sums are equal because (Aj, Hj) 



-(My,Ar\H? +1 ,A?,H?; 



i-1 



HI 



(I? -1 , Z™ +1 ) forms a Markov chain. In the following let Sj = (X , Z? +v M Y , A l { . 
A l5 H"). Then the Markov chains (Aj + A^-o-^-o- Xj and (Hj + Hj-o-Si-o- 
Xj are fulfilled and h(Ai + Aj|Sj) > —00 holds due to Q and the Markov Chain 
Aj^— (^A l f 1 , A", H™ J — o— Sj. Thus Corollary EJ can be applied and Inequality (jUj) 

follows. In Inequality (|46|) we used that differential entropy cannot be increased by con- 
ditioning and is invariant under shift together with the fact that Zj—o— (Si, Hj)— o— H^ 1 



and A-o-(A J f \ A?, H 



-Si form Markov chains. Finally, Inequality (}4Tj) follows 



due to the chain rule and because the marginal distributions of Aj and Hj do not depend 
on the index i. Note that the Markov chains used to justify these inequalities follow 
mainly from Assumption (J7|) and because the transmitter has access only to A" and H" 
but not to A" or to H". 



From this follows that 

R Y < -h{Y?\k n u A?, H?) - -U(Zr|M y ,H?, A?,H?) - ^-/i(A"|A", H") - - log a 2 
n An in 4 



4 E 



log + || Hi + Hi | 



log H 



+ 7 



||Ai + Ai| 

|Ai + Ai| |2] 
1 

n n 
Using the chain rule for differential entropy and the fact that differential entropy 



+ ^log(27re) + ^logE 
i* z <-A(Zr|M Y ,H?,A?,H£ 



+ e(n), 

-/i(^|M y , M z , H?, A™, H») + e(n) 



(48) 
(49) 



cannot decrease if conditioning is removed we have 

n n 

h(Y?\ A?, A™, H») = £ M^r 1 , A?, A", H?) < £ h(y 4 | A, A). 



(50) 



i=i i=i 

The terms on the right hand side of (joTIj) can then be upper bounded using the fact that a 
Gaussian distribution maximizes differential entropy under a second moment constraint, 
followed in a second step by using the Cauchy-Schwarz Inequality and by splitting up the 
expectation over Aj + A; into the expectations over the magnitude and phase of Aj + A, 
and applying Jensen's Inequality to the expectation over the phase. Further, Lemma 
El can be applied since E[||Aj|| 2 ] < oo for every i due to Condition (jSJ) and by defining 
Si = E[||Xj|| 2 ] which is finite according to (|B|). The last inequality follows by using again 
Jensen's inequality and because the distribution of Aj + Aj is assumed not to depend on 
the index i: 
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(51) 



Furthermore since the random vector X& is determined by the messages My, Mz and by 
the fading coefficients A" and H™ and since the sequence n[ z \ . . . , is IID jV(0, a 2 ) 
it follows that 

h(Z?\M Y ,M z ,HZ, A?, H?) = -log(27r«r 2 ). (52) 
Combining the bounds (f4"Kj) and (jjQj) and applying (JHU|) . (f5*T]) and (JH2*|) one obtains 



#r + ^Rz < ]z log 
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(53) 



where 7' = 7 + ^ + f log(27re). 

By exchanging the roles of the two receiving terminals in all the steps an upper bound 
on \Ry + Rz is obtained. This upper bound summed up with Bound (J55|) and divided 
by § yields 
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(54) 



Taking now the limit as n tends to infinity the term e(n) tends to and the sum rate 
can be upper bounded by 



R Y + Rz<^\og I 1 
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(55) 



Theorem ^ follows then by taking the limit of the ratio of the right hand side of (|55j) 
to log (l + -4sJ as £ tends to infinity since due to Assumptions (JBJ), © an d (fTTH) all the 
additive terms except for the first two in expression (J55|) are bounded for every value of £. 
In particular the terms on the second last line are finite since conditioning on Ai and Hi, 
respectively, fDJ Lemma 6.7 c)] shows that the terms are finite if /i(Aj.|Ai) and /i(Hi|Hi) 
are finite which follows by Assumptions and (jlOj) . Furthermore it can be showed using 



Jensen' Inequality that the terms E 



log+HHi+Hil 



and E 



log + ||Ai + Ai| 



whenever E[||Ai|| 2 ] and E[||Hi|| 2 ] are finite which holds due to 



stay finite 
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